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9 A^Ecation No.: 09455978 

NOTICE TO COMPLY WITH REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

Applicant must file the items indicated below within the time period set the Office action to which the 
Notice is attached to avoid abandonment under 35 U.S.C. § 133 (extensions of time may be obtained 
under the provisions of 37 CFR 1.136(a)). 

cleotide and/or amino acid sequence disclosure contained in this application does not comply with 
* ^merits for such a disclosure as set forth in 37 C.F.R. 1.821 - 1.825 for the following reason(s): 

This application clearly fails to comply with the requirements of 37 C.F.R. 1 .821-1 .825. Applicant's 
(attention is directed to the final rulemaking notice published at 55 FR 18230 (May 1, 1990), and 1114 
OG 29 (May 1 5, 1 990). If the effective filing date is on or after July 1 , 1 998, see the final rulemaking 
notice published at 63 FR 29620 (June 1, 1998) and 1211 OG 82 (June 23, 1998). 

This application does not contain, as a separate part of the disclosure on paper copy, a "Sequence 
— ' Listing" as required by 37 C.F.R. 1 .821 (c). 

□ 3. A copy of the "Sequence Listing" in computer readable form has not been submitted as required by 
37 C.F.R. 1.821(e). 

[X~l 4. A copy of the "Sequence Listing" in computer readable form has been submitted. However, the content 

— of the computer readable form does not comply with the requirements of 37 C.F.R. 1 .822 and/or 1 .823, 
as indicated on the attached copy of the marked -up "Raw Sequence Listing." 

I I 5 The computer readable form that has been filed with this application has been found to be damaged 

— and/or unreadable as indicated on the attached CRF Diskette Problem Report. A Substitute computer 
readable form must be submitted as required by 37 C.F.R. 1 .825(d). 

[ | 6. The paper copy of the "Sequence Listing" is not the same as the computer readable from of the 
"Sequence Listing" as required by 37 C.F.R. 1 .821 (e). 

| | L Other: _ 

Applicant Must Provide: 

[x] An initial or substitute computer readable form (CRF) copy of the "Sequence Listing" 

0 

0A statement that the content of the paper and computer readable copies are the same and, where applicable 
include no new matter, as required by 37 C.F.R. 1 .821 (e) or 1 .821 (f) or 1 .821 (g). 

For questions regarding compliance to these requirements, please contact: 
. For Rules Interpretation, call (703) 308-4216 or (703) 308-2923 

• For CRF Submission Help, call (703) 308-4212 

• For Patentln software Program Support: 

• HELP DESK: (703) 739-8559, ext 508, M-F, 8 AM to 5 PM EST except holidays 

. Email: PATIN21HELP@uspto.gov 

. To purchase Patentln sofftware: (703)306-2600 



An initial or substitute paper copy of the "Sequence Listing", as well as an amendment directing its entry into 
the specification. 



PLEASE RETURN A COPY OF THIS NOTICE WITH YOUR RESPONSE 



# 



BIOTESaFOLOCY 

SYSTEMS 
BRANCH 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) delected errors when processing the following computer readable 
form: 

Application Serial Number: 6^ j ,Cf ' ^jf^ 

QlPh : 



Source: 
Date Processed by STIC: 



THE ATTACHED PRINTOUT EXTLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO TIIE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21hcln@usnto.eov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3hcln@usnto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a statc-of thc-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for ihc original version of 37 CFR §§1.821 - 1.825 effective October I, 1990 (old . 
rules) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer crrorcd sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



h(t|)://\vw>v.us|)(o.gov/\vcb/officcs/[)ac/chcclcci 



0 Raw Sequence Listing Error Summary 



ERROR DETECTED 
ATTN: NEW RULES CASES: 

1 Wrapped Nucleics 

Wrapped Aminos 



I: 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA* HEADERS, WHICH WERE INSERTED DV PTO SOFTWARE 

The number/text at the end of each line , ^v^appcd ,> down lo the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

Numbering 

4 Non-ASCII 



The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 



Variable Length Sequences) ^contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



_Patcntln 2.0 
"bug" 



7 Skipped Sequences 

(OLD RULES) 



A "bug" in Palcntln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

scqucnccs(s) . Normally, Palcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequencers) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ CD NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRiPTION:SEQ ED NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Skipped Sequences 
"(NEW RULES) 



Scqucncc(s) 



<210> sequence id number 
<400> sequence id number 
000 



missing. If intentional, please insert the following lines for each skipped sequence. 



10 



II 



12 



Use of n*s or XaaV Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

"(NEW RULES) Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



_lnvalid <213> 
Response 

Uscof<220> 



.PalenUn2.0 . 
"bug" 



Per 1.823 of Sequence Rules, the only valid <2I3> responses arc: Unknown, Artificial Sequence, or 
scientific name (Gcnus/spccics). <220>-<223> section is required when <21 3> response is Unknown or 
is Artificial Sequence 



Sequences) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <21 3> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(Sec "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causas a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 06/04/2001 




RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 5 5 , 978 



Page 1 of 7 



OIPE 



DATE: 
TIME: 



07/07/2001 
13:15:00 



Input Set : A:\H1020011.app 

Output Set: N:\CRF3\07062001\I455978.raw 



Does Not Comply 
Corrected Diskette Needed 



3 <110> APPLICANT: Alam, Maqsudul 

4 Larsen, Randy 

6 <120> TITLE OF INVENTION: HEME PROTEINS HEMAT-HS AND HEMAT-BS AND THEIR USE 

7 MEDICINE AND MICROSENSORS 

9 <130> FILE REFERENCE: 201040/1020 

11 <140> CURRENT APPLICATION NUMBER: 09/455,978 

12 <141> CURRENT FILING DATE: 1999-12-06 
14 <160> NUMBER OF SEQ ID NOS : 86 
16 <170> SOFTWARE: Patentln Ver. 2.1 

18 <210> SEQ ID NO: 1 

19 <211>' LENGTH: 1470 

20 <212> TYPE: DNA 

21 <213> ORGANISM: Halobacter ium salinarum 
2 3 <4 00> SEQUENCE: 1 

24 atgagcaacg ataatgacac tctcgtgacc gccgacgttc ggaacgggat cgacgggcac 60 

25 gcactcgcgg accggatcgg cctcgacgag gcggagatcg cgtggcggct gtcgttcacc 120 

26 gggatcgacg acgacacgat -ggccgcgctc gccgccgaac agccgctgtt cgaagccacc 180 

27 gcggacgcgc tggtgaccga cttctacgac cacttggagt cctacgagcg cacacaggac 240 

28 ctcttcgcga actccacgaa gaccgtcgag caactcaaag agacgcaggc cgagtacttg 300 

29 ctgggcctcg ggcgcggcga gtacgacacc gagtacgccg cccagcgcgc ccgtatcggg 360 

30 aagatacacg acgtgctcgg gctcggaccg gacgtctatc tgggcgcgta cacgcgatac 420 

31 tacacggggc tgttggacgc gcttgccgac gacgtggtcg ccgaccgcgg cgaggaggcg 480 

32 gccgccgccg tcgacgaact cgtggcccgg ttcctgccga tgttgaagct gttgaccttc 54 0 

33 gatcagcaga tcgcaatgga cacctacatc gactcgtacg cccagcgcct ccacgacgag 600 

34 atcgacagcc gccaggagtt ggcgaacgcg gtcgccacgc acgtggaagc accgctgtcc 660 

35 tcgctggagg cgacctcgca ggacgtcgcc gagcgcacgg acacgatgcg ggcccgcacc 720 

36 gacgaccagg tcgaccgcat ggctgacgtc agccgtgaga tatccagcgt gtccgcgagc 780 

37 gtcgaggagg tcgcctcgac ggccgacgac gtccgccgga ccagcgagga cgccgaggcg 840 

38 ctggcccagc agggcgaggc ggccgccgac gacgcgctcg ccacgatgac cgacatcgac 900 

39 gaggcgaccg acggcgtcac cgcgggcgtc gaacagctcg gcgagcgcgc cgccgacgtc 960 

40 gaatcagtga ccggcgtgat cgacgacatc gccgagcaga cgaacatgct ggcgctgaac 1020 

41 gcgtccatcg aggccgcccg cgccggggag gcgggcgagg ggtttgcggt cgtcgccgac 1080 

42 gaggtcaagg ccctcgccga ggagtcccgc gagcagtcca cgcgcgtcga ggagctcgtc 1140 
4 3 gagcagatgc aggcggagac cgaggagacg gtcgaccagt tggacgaggt caaccagcgc 1200 

44 atcggcgagg gcgtcgagcg cgtcgaggag gcgatggaga ccctccagga gatcaccgac 1260 

45 gccgtcgagg acgccgcaag cgggatgcag gaggtgtcga cggcgaccga cgaacaggcg 1320 
4 6 gtgagcaccg aggaggtcgc cgagatggtc gacggtgtcg acgaccgcgc gggcgagatc 1380 
4 7 gcggccgccc tcgatgacat cgccgacgcg accgatcagc aggtccggac cgtcgaggag 1440 
4 8 gtccgcgaga cggtcggcaa gctcagctag 14 70 

51 <210> SEQ ID NO: 2 

52 <211> LENGTH: 489 

53 <212> TYPE: PRT 

54 <213> ORGANISM: Halobacterium salinarum 

56 <400> SEQUENCE: 2 

57 Met Ser Asn Asp Asn Asp Thr Leu Val Thr Ala Asp Val Arg Asn Gly 

58 1 5 10 15 
60 lie Asp Gly His Ala Leu Ala Asp Arg lie Gly Leu Asp Glu Ala Glu 



IN 



nie://C:\CRF3\OuthoId\VsrI455978.htm 



7/7/01 



RAW SEQUENCE LISTING DATE: 07/07/2001 

PATENT APPLICATION: US/09/4 55,978 TIME: 13:15:00 

Input Set : A:\H1020011.app 

Output Set: N:\CRF3\07062001\I455978.raw 
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61 








20 










25 










30 






63 


lie 


Ala 


Trp 


Arg 


Leu 


Ser 


Phe 


Thr 


Gly 


lie 


Asp 


Asp 


Asp 


Thr 


Met 


Ala 


.64 






35 










40 










45 








66 . 


Ala 


Leu 


Ala 


Ala 


Glu 


Gin 


Pro 


Leu 


Phe 


Glu 


Ala 


Thr 


Ala 


Asp 


Ala 


Leu 


67 




50 










55 










60 










69 


Val 


Thr 


Asp 


Phe 


Tyr 


Asp 


His 


Leu 


Glu 


Ser 


Tyr 


Glu 


Arg 


Thr 


Gin 


Asp 


70 


65 










70 










75 










80 


72 


Leu 


Phe 


Ala 


Asn 


Ser 


Thr 


Lys 


Thr 


Val 


Glu 


Gin 


Leu 


Lys 


Glu 


Thr 


Gin 


73 










85 










90 










95 




75 


Ala 


Glu 


Tyr 


Leu 


Leu 


Gly 


Leu 


Gly 


Arg 


Gly 


Glu 


Tyr 


Asp 


Thr 


Glu 


Tyr 


76 








100 










105 










U0 






78 


Ala 


Ala 


Gin 


Arg 


Ala 


Arg 


He 


Gly 


Lys 


He 


His 


Asp 


Val 


Leu 


Gly 


Leu 


79 






115 










120 










125 








81 


Gly 


Pro 


Asp 


val 


Tyr 


Leu 


Gly 


Ala 


Tyr 


Thr 


Arg 


Tyr 


Tyr 


Thr 


Gly 


Leu 


82 




130 








'135 










140 










84 


Leu 


Asp 


Ala 


Leu 


Ala 


Asp 


Asp 


Val 


Val 


Ala 


Asp 


Arg 


Gly 


Glu 


Gla 


Ala 


85 


145 










150 










155 










160 


87 


Ala 


Ala 


Ala 


Val 


Asp 


Glu 


Leu 


Val 


Ala 


Arg 


Phe 


Leu 


Pro 


Met 


Leu 


Lys 


88 










165 










170 










175 




90 


Leu 


Leu 


Thr 


Phe 


Asp 


Gin 


Gin 


He 


Ala 


Met 


Asp 


Thr 


Tyr 


He 


Asp 


Ser 


91 








180 










185 










190 






93 


Tyr 


Ala 


Gin 


Arg 


Leu 


His 


Asp 


Glu 


He 


Asp 


Ser 


Arg 


Gin 


Glu 


Leu 


Ala 


94 






195 










200 










205 








96 


Asn 


Ala 


Val 


Ala 


Thr 


His 


Val 


Glu 


Ala 


Pro 


Leu 


Ser 


Ser 


Leu 


Glu 


Ala 


97 




210 










215 










220 










99 


Thr 


Ser 


Gin 


Asp 


Val 


Ala 


Glu 


Arg 


Thr 


Asp 


Thr 


Met 


Arg 


Ala 


Arg 


Thr 


100 


225 










230 










235 










240 


102 


Asp 


Asp 


Gin 


Val 


Asp 


Arg 


Met 


Ala 


Asp 


Val 


Ser 


Arg 


Glu 


He 


Ser 


Ser 


103 










24 5 










250 










255 




105 


Val 


Ser 


Ala 


Ser 


Val 


Glu 


Glu 


Val 


Ala 


Ser 


Thr 


Ala 


Asp 


Asp 


Val 


Arg 


106 








260 










265 










270 






108 


Arg 


Thr 


Ser 


Glu 


Asp 


Ala 


Glu 


Ala 


Leu 


Ala 


Gin 


Gin 


Gly 


Glu 


Ala 


Ala 


109 






275 










280 










285 








111 


Ala 


Asp 


Asp 


Ala 


Leu 


Ala 


Thr 


Met 


Thr 


Asp 


He 


Asp 


Glu 


Ala 


Thr 


Asp 


112 




290 










295 










300 










114 


Gly 


Val 


Thr 


Ala 


Gly 


Val 


Glu 


Gin 


Leu 


Gly 


Glu 


Arg 


Ala 


Ala 


Asp 


Val 


115 


305 










310 










315 










320 


117 


Glu 


Ser 


Val 


Thr 


Gly 


Val 


He 


Asp 


Asp 


He 


Ala 


Glu 


Gin 


Thr 


Asn 


Met 


118 










325 










330 










335 




120 


Leu 


Ala 


Leu 


Asn 


Ala 


Ser 


He 


Glu 


Ala 


Ala 


Arg 


Ala 


Gly 


Glu 


Ala 


Gly 


121 








340 










345 










350 






123 


Glu 


Gly 


Phe 


Ala 


Val 


Val 


Ala 


Asp 


Glu 


Val 


Lys 


Ala 


Leu 


Ala 


Glu 


Glu 


124 






355 










360 










- 365 








126 


Ser 


Arg 


Glu 


Gin 


Ser 


Thr 


Arg 


Val 


Glu 


Glu 


Leu 


Val 


Glu 


Gin 


Met 


Gin 


127 




370 










375 










380 










129 


Ala 


Glu 


Thr 


Glu 


Glu 


Thr 


Val 


Asp 


Gin 


Leu 


Asp 


Glu 


Val 


Asn 


Gin 


Arg 


130 


385 










390 










395 










400 


132 


He 


Gly 


Glu 


Gly 


Val 


Glu 


Arg 


Val 


Glu 


Glu 


Ala 


Met 


Glu 


Thr 


Leu 


Gin 


133 










405 










410 










415 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 55 , 97 8 



DATE: 07/07/2001 
TIME: 13:15:00 



Input Set : A:\H1020011.app 

Output Set: N:\CRF3\07062001\I455978.raw 



135 Glu lie Thr Asp Ala Val Glu Asp Ala Ala Ser 

136 420 425 

.138 Ser Thr Ala Thr Asp Glu Gin Ala Val Ser Thr 

139 435 440 

141 Met Val Asp Gly Val Asp Asp Arg Ala Gly Glu 

142 450 455 

144 Asp Asp lie Ala Asp Ala Thr Asp Gin Gin Val 

145 465 470 475 

147 Val Arg Glu Thr Val Gly Lys Leu Ser 

148 485 

151 <210> SEQ ID NO: 3 

152 <211> LENGTH: 1390 

153 <212> TYPE: DNA 

154 <213> ORGANISM: Bacillus subtilis 

156 <400> SEQUENCE: 3 

157 atgttattta aaaaagacag aaaacaagaa acagcttact 

158 caaaaaaacc gcattcagct cacaaacaaa catgcagatg 

159 gtcaggttgg gagatgctga gctttatgtg ttagagcagc 

160 aatatcgtaa atatcgtcga tgcgttttat aaaaaccttg 

161 gatatcatta atgatcacag ctcagttgac cgcttaaaac 

162 caggaaatgt ttgcaggcgt tatcgatgat gaatttattg 

163 tccatccatt taagaatcgg ccttttgcca aaatggtata 

164 cttttgtcaa tgattgacat ttatgaagcg tccattacaa 

165 gccattaaag caacaacaaa aatcttgaac ttagaacagc 

166 caaagcgagt acaaccagac ccgtgatgaa caagaagaaa 

167 aaaattcaag aaacctctgg atcgattgcc attctgtttt 

168 caagagcttg tggaeaaatc tgaaggeatt tctcaagcat 

169 tccagcactg ttgaagaaaa gtcgatcggc ggaaaaaaag 

170 cagatgaaca aaattgacac aagccttgtc caaatcgaaa 

171 gaaatcgcgc agcaaattga aaaaatcttc ggcatcgtca 

172 aaccttctct cgctcaatgc atctattgaa tccgcccgcg 
17 3 tttgctgtcg tggcaaatga agtgcggaag ctttctgagg 

174 actgtttctg agcttgtgaa caatacgaat acacaaatca 

175 aaagacgtga atgagctagt cagcgaaagt aaagaaaaaa 

176 ttcgatgaaa tcgtccacag catgaaaatc agcaaagagc 

177 gatctgcaag cctttcttgg agggcttcag gaagtcagcc 

178 gcttccgttg attcgcttgt catcctgaca gaagaataac 

179 tacggccggt ttttttgcgt tcattatgta aacttaaatt 

180 attacctgca 

183 <210> SEQ ID NO: 4 

184 <211> LENGTH: 432 

185 <212> TYPE: PRT 

186 <213> ORGANISM: Bacillus subtilis 

188 <400> SEQUENCE: 4 

189 Met Leu Phe Lys Lys Asp Arg Lys Gin Glu Thr 

190 15 10 

192 Ser Asn Gly Gin Gin Lys Asn Arg lie Gin Leu 

193 20 25 

195 Asp Val Lys Lys Gin Leu Lys Met Val Arg Leu 



Gly Met Gin Glu Val 
4 30 

Glu Glu Val Ala Glu 
445 

lie Ala Ala Ala Leu 
460. 

Arg Thr Val, Glu Glu 
480 



tttcagattc 
tcaaaaaaca 
ttcagccacfe 
accatgaaag 
aaacgttaaa 
aaaagcgtaa 
tgggtgcgtt 
atcagcaaga 
agcttgtcct 
agaaaaacct 
cagaaacaag 
ccaaagccgg 
agctagaagt 
aagaaatggt 
caggcatagc 
ccggagaaca 
atacgaaaaa 
acattgtatc 
tgacgcaaat 
aatcaggcaa 
gcgccgtttc 
catcaaaaac 
aaaaatcagt 



aaacggaca^a 
gctcaaaatg 
cattcaagaa 
ctcattgatg 
acggcatatt 
ccgaatcgcc 
tcaagagctc 
actgctaaaa 
tgaagcgttt 
tcttcatcag 
cagatcagtt 
cactgtaaca 
ccagcaaaaa 
caagctggat 
tgaaca.aaca 
cggcaaaggc 
aaccgtctct 
caagcatatc 
taaccgctta 
aatcgacgtc 
ccatgtggcc 
cggtctgcca 
tgacataata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

7 20 

780 , 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1390 



Ala Tyr Phe Ser Asp 
15 

Thr Asn Lys His Ala 
30 

Gly Asp Ala Glu Leu 
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196 



35 



40 



45 



198 Tyr Val Leu Glu Gin Leu Gin Pro Leu lie Gin Glu Asn lie Val Asn 

199 50 55 60 

201 lie Val Asp Ala Phe Tyr Lys Asn Leu Asp His Glu Ser Ser Leu Met 

202 65 70 75 80 

204 Asp lie lie Asn Asp His Ser Ser Val Asp Arg Leu^ Lys Gin Thr Leu 

205 85 90 95 

207 Lys Arg His lie Gin Glu Met Phe Ala Gly Val lie Asp Asp Glu Phe 

208 100 105 110 

210 lie Glu Lys Arg Asn Arg lie Ala Ser lie His Leu Arg lie Gly Leu 

211 115 120 125 , 

213 Leu Pro Lys Trp Tyr Met Gly Ala Phe Gin Glu Leu Leu Leu Ser Met 

214 130 135 140 

216 lie Asp lie Tyr Glu Ala Ser lie Thr Asn Gin Gin Glu Leu Leu Lys 

217 145 150 155 160 

219 Ala lie Lys Ala Thr Thr Lys lie Leu Asn Leu Glu Gin Gin Leu Val 

220 165 170 175 

222 Leu Glu Ala Phe Gin Ser Glu Tyr Asn Gin Thr Arg Asp Glu Gin Glu 

223 180 185 190 

225 Glu Lys Lys Asn Leu Leu His Gin Lys lie Gin Glu Thr Ser Gly Ser 

226 195 200 205 

228 lie Ala Asn Leu Phe Ser Glu Thr Ser Arg Ser Val Gin Glu Leu Val 

229 210 215 220 

231 Asp Lys Ser Glu Gly lie Ser Gin Ala Ser Lys Ala Gly Thr Val Thr 

232 225 230 235 240 

234 Ser Ser Thr Val Glu Glu Lys Ser lie Gly Gly Lys Lys Glu Leu Glu 

235 245 * 250 255 

237 Val Gin Gin Lys Gin Met Asn Lys lie Asp Thr Ser Leu Val Gin He 

238 260 265 „ 270 

240 Glu Lys Glu Met Val Lys Leu Asp Glu lie Ala Gin Gin He Glu Lys 

241 275 280 285 

243 He Phe Gly He Val Thr Gly He Ala Glu Gin Thr Asn Leu Leu Ser 

244 290 295 300 

246 Leu Asn Ala Ser He Glu Ser Ala Arg Ala Gly Glu His Gly Lys Gly 

247 305 310 315 320 

249 Phe Ala Val Val Ala Asn Glu Val Arg Lys Leu Ser Glu Asp Thr Lys 

250 325 330 335 

252 Lys Thr Val Ser Thr Val Ser Glu Leu Val Asn Asn Thr Asn Thr Gin 

253 340 345' 350 

255 He Asn He Val Ser Lys His He Lys Asp Val Asn Glu Leu Val Ser 

256 355 360 365 

258 Glu Ser Lys Glu Lys Met Thr Gin He Asn Arg Leu Phe Asp Glu He 

259 370 375 380 

261 Val His Ser Met Lys He Ser Lys Glu Gin Ser Gly Lys He Asp Val 

262 385 390 395 400 

264 Asp Leu Gin Ala Phe Leu Gly Gly Leu Gin Glu Val Ser Arg Ala Val 

265 405 410 - 415 

267 Ser His Val Ala Ala Ser Val Asp Ser Leu Val He Leu Thr Glu Glu 

268 420 425 430 



0 
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274 
275 
276 
277 
279 
280 
281 
283 
284 
285 
286 
288 

*--> 289 
290 

*--> 292 
293 

*--> 295 
296 

*--> 298 
299 
302 
303 
304 
305 
307 
308 
309 
311 
312 
314 
315 
317 
318 
321 
322 
323 
324 
326 
327 
328 
330 
331 
333 
334 
336 
337 
340 
341 
342 



<210> SEQ ID NO: 5 
<211> LENGTH: 57 
<212> TYPE: PRT 

<213> ORGANISM : Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Template 

sequence . - . / / / 

<220> FEATURE : X2lZ*7 ^GOT *0 

<221> NAME/KEY: UNSURE j- r Xa^/ S^/v~~ e u$€ o 
<222> LOCATION: (4) — C^cfC^ttt Vt^- loc*-k^ ii 
<223> OTHER INFORMATION: X at any position in this sequence is unknown. 
<400> SEQUENCE: 5 

lie lie Lys Xaa Thr Val Pro Val Leu Xaa Glu His Gly Xaa Xaa lie 

15 10 15 

Gly Gin Asp Val Leu Val* Val Leu lie Lys Xaa Asn Pro Glu lie Gin 

20 25 30 

Glu Lys Phe Phe Phe Phe Lys His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

50 55 
<210> SEQ ID NO: 6 
<211> LENGTH: 55 
<212> TYPE: PRT 

<213> ORGANISM: Erwinia chrysanthemi 
<400> SEQUENCE: 6 

lie Lys Ser Thr lie Pro Leu Leu Ala Glu Thr Gly Pro Ala Leu Thr 

15 10 15 

Ala His Phe Tyr Gin Arg Met Phe His His Asn Pro Glu Leu Lys Asp 

20 25 30 

lie Phe Asn Met Ser Asn Gln'Arg Asn Gly Asp Gin Arg Glu Ala Leu 

35 40 45 

Phe Asn Ala lie Cys Ala Tyr 
50 55 
<210> SEQ ID NO: 7 
<211> LENGTH: 56 
<212> TYPE: PRT 

<213> ORGANISM: Vitreoscilla stercoraria 
<400> SEQUENCE: 7 

lie lie Lys Ala Thr Val Pro Val Leu Lys Glu His Gly Val Thr lie 

15 10 15 

Thr Thr Thr Phe Tyr Lys Asn Leu. Phe Ala Lys His Pro Glu Val Arg 

20 25 30 

Pro Leu Phe Asp Met Gly Arg Gin Glu Ser Leu Glu Gin Pro Lys Ala 



* 



35 



40 



45 



Leu Ala Met Thr Val Leu Ala Ala 



50 

<210> SEQ ID NO: 8 
<211> LENGTH: 55 
<212> TYPE: PRT 



55 



Use of n and/or Xaa lias been delected in the Sequence Listing. 
Review the Sequence Listing to insure a corresponding 
explanation is presented in the <220> to <223> fields of 
each sequence using n or Xaa. 
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289 
292 
295 
298 
752 
755 
934 
952 
106 
170 
170 
172 
174 
176 
178 



M:341 W 

M:341 W 

M:341 W 

M:341 W 

M:341 W 

M:341 W 

M:341 W 

M:341 W 
1 M:341 



W 

M:341 W 
M:341 W 
341 W 
341 W 
341 W 
341 W 



(46) ' 
(46) ' 
(46) ' 
(46) ' 
(46) 1 
(46) ' 
(46) « 
(46) 1 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 



n 
n 
n 
n 
n 
n 
n 
n 

n n n 

"n" 
"n" 
"n" 



or " 
or " 
or " 
or " 
or n 
or n 
or " 
or n 
or 
or 
or 
or 
or 
or 
or 



Xaa" 
Xaa" 
Xaa" 
Xaa" 
Xaa" 
Xaa" 
Xaa" 
Xaa" 
"Xaa 
"Xaa 
"Xaa 
"Xaa 
"Xaa 
"Xaa 
"Xaa 



used, 
used, 
used, 
used, 
used, 
used, 
used, 
used, 
' used, 
' used, 
' used, 
' used, 
' used, 
' used, 
' used, 



for SEQ IDS : 5 
for SEQ IDS: 5 
for SEQ IDS :5 
for SEQ IDS : 5 
for SEQ ID#:30 
for SEQ IDS : 30 
for SEQ IDS: 41 
for SEQ IDft :42 



for SEQ ID# 
for SEQ ID# 
for SEQ ID# 
for SEQ ID# 
for SEQ ID# 
for SEQ ID# 
for SEQ IDS 



:50 
:82 
:82 
:83 
:84 
:85 
:86 
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